PETSC: pattern-based embedding for time series classification

نویسندگان

چکیده

Efficient and interpretable classification of time series is an essential data mining task with many real-world applications. Recently several dictionary- shapelet-based methods have been proposed that employ contiguous subsequences fixed length. We extend pattern to efficiently enumerate long variable-length sequential patterns gaps. Additionally, we discover at multiple resolutions thereby combining cohesive vary in length, duration resolution. For construct embedding based on occurrences learn a linear model. The discovered form the basis for insight into each class series. pattern-based (PETSC) supports both univariate multivariate datasets varying length subject noise or missing data. experimentally validate MR-PETSC performs significantly better than baseline such as DTW, BOP SAX-VSM On series, our method comparably recent methods, including BOSS, cBOSS, S-BOSS, ProximityForest ResNET, only narrowly outperformed by state-of-the-art HIVE-COTE, ROCKET, TS-CHIEF InceptionTime. Moreover, PETSC current CIF none which are interpretable. scales large total training making predictions all 85 ‘bake off’ UCR archive under 3 h it one fastest available. particularly useful learns model where feature represents domain, human oversight ensure trustworthy fair financial, medical bioinformatics

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Embedding Time Series Data for Classification

We propose an approach to embed time series data in a vector space based on the distances obtained from Dynamic Time Warping (DTW), and to classify them in the embedded space. Under the problem setting in which both labeled data and unlabeled data are given beforehand, we consider three embeddings, embedding in a Euclidean space by MDS, embedding in a Pseudo-Euclidean space, and embedding in a ...

متن کامل

Model Based Method for Determining the Minimum Embedding Dimension from Solar Activity Chaotic Time Series

Predicting future behavior of chaotic time series system is a challenging area in the literature of nonlinear systems. The prediction's accuracy of chaotic time series is extremely dependent on the model and the learning algorithm. On the other hand the cyclic solar activity as one of the natural chaotic systems has significant effects on earth, climate, satellites and space missions. Several m...

متن کامل

RPM: Representative Pattern Mining for Efficient Time Series Classification

Time series classification is an important problem that has received a great amount of attention by researchers and practitioners in the past two decades. In this work, we propose a novel algorithm for time series classification based on the discovery of class-specific representative patterns. We define representative patterns of a class as a set of subsequences that has the greatest discrimina...

متن کامل

Discovering Key Sequences in Time Series Data for Pattern Classification

This paper addresses the issue of discovering key sequences from time series data for pattern classification. The aim is to find from a symbolic database all sequences that are both indicative and non-redundant. A sequence as such is called a key sequence in the paper. In order to solve this problem we first we establish criteria to evaluate sequences in terms of the measures of evaluation base...

متن کامل

Transformation Based Ensembles for Time Series Classification

Until recently, the vast majority of data mining time series classification (TSC) research has focused on alternative distance measures for 1-Nearest Neighbour (1-NN) classifiers based on either the raw data, or on compressions or smoothing of the raw data. Despite the extensive evidence in favour of 1-NN classifiers with Euclidean or Dynamic Time Warping distance, there has also been a flurry ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Mining and Knowledge Discovery

سال: 2022

ISSN: ['1573-756X', '1384-5810']

DOI: https://doi.org/10.1007/s10618-022-00822-7